Barycentric Approximator for Reinforcement Learning Control

نویسنده

Whang Cho

چکیده

Recently, various experiments to apply reinforcement learning method to the self-learning intelligent control of continuous dynamic system have been reported in the machine learning related research community. The reports have produced mixed results of some successes and some failures, and show that the success of reinforcement learning method in application to the intelligent control of continuous control systems depends on the ability to combine proper function approximation method with temporal difference methods such as Q-learning and value iteration. One of the difficulties in using function approximation method in connection with temporal difference method is the absence of guarantee for the convergence of the algorithm. This paper provides a proof of convergence of a particular function approximation method based on "barycentric interpolator" which is known to be computationally more efficient than multilinear interpolation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Barycentric Interpolators for Continuous Space and Time Reinforcement Learning

In order to find the optimal control of continuous state-space and time reinforcement learning (RL) problems, we approximate the value function (VF) with a particular class of functions called the barycentric interpolators. We establish sufficient conditions under which a RL algorithm converges to the optimal VF, even when we use approximate models of the state dynamics and the reinforcement fu...

متن کامل

2 Definition of Barycentric Interpolators

In order to nd the optimal control of continuous state-space and time reinforcement learning (RL) problems, we approximate the value function (VF) with a particular class of functions called the barycentric interpolators. We establish su cient conditions under which a RL algorithm converges to the optimal VF, even when we use approximate models of the state dynamics and the reinforcement functi...

متن کامل

Path-Tracking Control of a Non-Holonomic Car-Like Robot with Reinforcement Learning

The problem investigated in this paper is that of driving a car-like robot along a race track and the use of reinforcement learning to find a good control function. The reinforcement learner uses a case-based function approximator to extend the reinforcement learning paradigm to handle continuous states. The learned controller performs similar to the best control functions in both simulation an...

متن کامل

Control of Inverted Double Pendulum using Reinforcement Learning

In this project, we apply reinforcement learning techniques to control an inverted double pendulum on a cart. We successfully learn a controller for balancing in a simulation environment using Qlearning with a linear function approximator, without any prior knowledge of the system at hand. We do however fail to learn a controller for the swingup maneuver, which leads to a discussion on what mig...

متن کامل

Reinforcement learning on an omnidirectional mobile robot

With this paper we describe a well suited, scalable problem for reinforcement learning approaches in the field of mobile robots. We show a suitable representation of the problem for a reinforcement approach and present our results with a model based standard algorithm. Two different approximators for the value function are used, a grid based approximator and a neural network based approximator.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Barycentric Approximator for Reinforcement Learning Control

نویسنده

چکیده

منابع مشابه

Barycentric Interpolators for Continuous Space and Time Reinforcement Learning

2 Definition of Barycentric Interpolators

Path-Tracking Control of a Non-Holonomic Car-Like Robot with Reinforcement Learning

Control of Inverted Double Pendulum using Reinforcement Learning

Reinforcement learning on an omnidirectional mobile robot

عنوان ژورنال:

اشتراک گذاری